Skip to content

feat: support joyai-llm-flash model on npu device.#1121

Merged
XuZhang99 merged 5 commits intojd-opensource:mainfrom
longhui-z:main
Apr 2, 2026
Merged

feat: support joyai-llm-flash model on npu device.#1121
XuZhang99 merged 5 commits intojd-opensource:mainfrom
longhui-z:main

Conversation

@longhui-z
Copy link
Copy Markdown
Contributor

No description provided.

@XuZhang99 XuZhang99 changed the title feat:support joyai-lllm-flash model (for npu). feat:support joyai-llm-flash model for npu device. Mar 27, 2026
Copy link
Copy Markdown
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request introduces the JoyAILLMFlash model implementation for NPU, including its core logic, weight loading mechanisms, and configuration arguments. It also optimizes expert weight merging in the DeepSeek-V2 loader to support quantized formats. Feedback focuses on improving code maintainability and correctness by replacing magic numbers with named constants, using dynamic calculations for model dimensions instead of hardcoded values, and fixing signed/unsigned comparison warnings in loops.

@longhui-z longhui-z requested a review from XuZhang99 March 27, 2026 07:28
XuZhang99
XuZhang99 previously approved these changes Mar 27, 2026
yq33victor
yq33victor previously approved these changes Mar 28, 2026
Copy link
Copy Markdown
Collaborator

@yq33victor yq33victor left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

XuZhang99
XuZhang99 previously approved these changes Mar 28, 2026
@liutongxuan liutongxuan changed the title feat:support joyai-llm-flash model for npu device. feat: support joyai-llm-flash model on npu device. Mar 30, 2026
@longhui-z longhui-z dismissed stale reviews from XuZhang99 and yq33victor via a8715d5 March 30, 2026 06:31
XuZhang99
XuZhang99 previously approved these changes Mar 30, 2026
yq33victor
yq33victor previously approved these changes Mar 30, 2026
Copy link
Copy Markdown
Collaborator

@yq33victor yq33victor left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@longhui-z longhui-z requested a review from yq33victor April 1, 2026 06:04
@longhui-z longhui-z requested a review from RobbieLeung April 1, 2026 11:11
Copy link
Copy Markdown
Collaborator

@yq33victor yq33victor left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@XuZhang99 XuZhang99 merged commit e539c93 into jd-opensource:main Apr 2, 2026
24 of 39 checks passed
DongheJin pushed a commit to DongheJin/xllm that referenced this pull request Apr 3, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants